LTR_STRUC: a novel search and identification program for LTR retrotransposons
نویسندگان
چکیده
MOTIVATION Long terminal repeat (LTR) retrotransposons constitute a substantial fraction of most eukaryotic genomes and are believed to have a significant impact on genome structure and function. Conventional methods used to search for LTR retrotransposons in genome databases are labor intensive. We present an efficient, reliable and automated method to identify and analyze members of this important class of transposable elements. RESULTS We have developed a new data-mining program, LTR_STRUC (LTR retrotransposon structure program) which identifies and automatically analyzes LTR retrotransposons in genome databases by searching for structural features characteristic of such elements. LTR_STRUC has significant advantages over conventional search methods in the case of LTR retrotransposon families having low sequence homology to known queries or families with atypical structure (e.g. non-autonomous elements lacking canonical retroviral ORFs) and is thus a discovery tool that complements established methods. LTR_STRUC finds LTR retrotransposons using an algorithm that encompasses a number of tasks that would otherwise have to be initiated individually by the user. For each LTR retrotransposon found, LTR_STRUC automatically generates an analysis of a variety of structural features of biological interest. AVAILABILITY The LTR_STRUC program is currently available as a console application free of charge to academic users from the authors.
منابع مشابه
Mosquitoes LTR Retrotransposons: A Deeper View into the Genomic Sequence of Culex quinquefasciatus
A set of 67 novel LTR-retrotransposon has been identified by in silico analyses of the Culex quinquefasciatus genome using the LTR_STRUC program. The phylogenetic analysis shows that 29 novel and putatively functional LTR-retrotransposons detected belong to the Ty3/gypsy group. Our results demonstrate that, by considering only families containing potentially autonomous LTR-retrotransposons, the...
متن کاملA Nest of LTR Retrotransposons Adjacent the Disease Resistance-Priming Gene NPR1 in Beta vulgaris L. U.S. Hybrid H20
A nest of long terminal repeat (LTR) retrotransposons (RTRs), discovered by LTR_STRUC analysis, is near core genes encoding the NPR1 disease resistance-activating factor and a heat-shock-factor-(HSF-) like protein in sugarbeet hybrid US H20. SCHULTE, a 10 833 bp LTR retrotransposon, with 1372 bp LTRs that are 0.7% divergent, has two ORFs with unexpected introns but encoding a reverse transcript...
متن کاملNewly identified families of human endogenous retroviruses.
Human endogenous retroviruses (HERVs) make up approximately 8.3% of the human genome (12). HERVs have previously been classified into 31 distinct families based upon sequence alignment of reverse transcriptase (RT) and envelope domains and subsequent phylogenetic analyses (1, 9, 16). Using the data mining program LTR_STRUC (13) in conjunction with conventional sequence homology techniques, we r...
متن کاملGetting an Evolutionary Handle on Life after Reproduction
Background: LTR retrotransposons are a class of mobile genetic elements containing two similar long terminal repeats (LTRs). Currently, LTR retrotransposons are annotated in eukaryotic genomes mainly through the conventional homology searching approach. Hence, it is limited to annotating known elements. Results: In this paper, we report a de novo computational method that can identify new LTR r...
متن کاملMGEScan-non-LTR: computational identification and classification of autonomous non-LTR retrotransposons in eukaryotic genomes
Computational methods for genome-wide identification of mobile genetic elements (MGEs) have become increasingly necessary for both genome annotation and evolutionary studies. Non-long terminal repeat (non-LTR) retrotransposons are a class of MGEs that have been found in most eukaryotic genomes, sometimes in extremely high numbers. In this article, we present a computational tool, MGEScan-non-LT...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 19 3 شماره
صفحات -
تاریخ انتشار 2003